Symbolic Speaker Adaptation for Pronunciation Modeling
نویسندگان
چکیده
This paper presents a method of modeling a speaker’s pronunciation of a given language as a blend of ”standard” speech and other non-standard speech varieties (regional dialects and foreign accented pronunciation styles) by way of speaker-dependent modification of a lexicon. In this system, a lexicon of Standard American English (SAE) forms, the ”canonical” lexicon, is filtered and transformed via a group of speech variety (SV) dependent rule sets into a speaker specific set of pronunciation variants (and associated probabilities) for use during recognition. The relative importance of these rule sets depends on the speaker’s pronunciation characteristics and is represented by a Speech Variety Profile (SVP) associated with each speaker. A speaker’s individual SVP is acquired through feedback from an adaptation process. Convergence to a speaker’s SVP represents adaptation of the lexicon (symbolic adaptation) to those SVspecific forms that speaker is likely to utter.
منابع مشابه
Improving pronunciation modeling for non-native speech recognition
In this paper, three different approaches to pronunciation modeling are investigated. Two existing pronunciation modeling approaches, namely the pronunciation dictionary and n-best rescoring approach are modified to work with little amount of non-native speech. We also propose a speaker clustering approach, which capable of grouping the speakers based on their pronunciation habits. Given some s...
متن کاملSymbolic speaker adaptation with phone inventory expansion
This paper further develops a previously proposed adaptation method for speech recognition called Symbolic Speaker Adaptation (SSA). The basic idea of SSA is to model a speaker’s pronunciation as a blend of speech varieties (SVs) regional dialects and foreign accents for which the system has existing pronunciation models. The system determines during an adaptation process the relative applicabi...
متن کاملPronunciation and Acoustic Model Adaptation for Improving Multilingual Speech Recognition
In this paper, we address the importance of pronunciation and acoustic model adaptation in multilingual speech recognition. When aiming at modeling several languages simultaneously, the degree of speaker and language variability is even greater than when concentrating on only one language. To compensate the pronunciation variability across various speaker, bi-lingual pronunciation modeling is p...
متن کاملUnsupervised Pronunciation Adaptation for Off-line Transcription of Japanese Lecture Speeches
Observing that most variations in pronunciation are strongly speaker and speaking style dependent, and that the introduction of pronunciation variants in a speaker-independent recognition system is of limited success, we refrain from applying multiple pronunciation variants in the speakerindependent case and instead introduce pronunciation variants without supervision when specializing the reco...
متن کاملImproving speech recognition for children using acoustic adaptation and pronunciation modeling
Developing a robust Automatic Speech Recognition (ASR) system for children is a challenging task because of increased variability in acoustic and linguistic correlates as function of young age. The acoustic variability is mainly due to the developmental changes associated with vocal tract growth. On the linguistic side, the variability is associated with limited knowledge of vocabulary, pronunc...
متن کامل